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Heisenberg’s uncertainty principle was originally posed for the limit of the accuracy of simultaneous mea¬ 
surement of non-commuting observables as stating that canonically conjugate observables can be measured 
simultaneously only with the constraint that the product of their mean errors should be no less than a limit set 
by Planck’s constant. However, Heisenberg with the subsequent completion by Kennard has long been credited 
only with a constraint for state preparation represented by the product of the standard deviations. Here, we show 
that Heisenberg actually proved the constraint for the accuracy of simultaneous measurement but assuming an 
obsolete postulate for quantum mechanics. This assumption, known as the repeatability hypothesis, formu¬ 
lated explicitly by von Neumann and Schrbdinger, was broadly accepted until the 1970s, but abandoned in the 
1980s, when completely general quantum measurement theory was established. We also survey the author’s re¬ 
cent proposal for a universally valid reformulation of Heisenberg’s uncertainty principle under the most general 
assumption on quantum measurement. 
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I. INTRODUCTION 

The uncertainty principle proposed by Heisenberg^ in 1927 
revealed that we cannot determine both position and momen¬ 
tum of a particle simultaneously in microscopic scale as stat¬ 
ing “the more precisely the position is determined, the less 
precisely the momentum is known, and conversely”— (p. 64), 
and had overturned the deterministic world view based on 
the Newtonian mechanics. By the famous y ray microscope 
thought experiment Heisenberg^ derived the relation 

e{q)e{p)'^h (1) 

for e{q), the “mean error” of the position measurement, and 
e(p), thereby caused “discontinuous change” of the momen¬ 
tum, or more generally the mean error of the simultaneous 
momentum measurement, where h is Planck’s constant: 

Let e{q) [originally, ^i] be the precision with 
which the value q is known (e(^) is, say, the mean 
error of q), therefore here the wavelength of the 
light. Let e{p) [originally, p\\ be the precision 
with which the value p is determinable; that is, 
here, the discontinuous change of p in the Comp¬ 
ton effect^ (p. 64). 

Heisenberg claimed that this relation is a “straightforward 
mathematical consequence”— (p. 65) of fundamental postu¬ 
lates for quantum mechanics. In his mathematical derivation 
of relation ([T]i, he derived 

= \ ( 2 ) 

for standard deviations o{q) and (j(p) of position q and 
momentum p for a class of Gaussian wave functions, later 
known as minimum uncertainty wave packets. Subsequently, 
Kennard^ proved the inequality 

(y{q)(y{p)>^^ (3) 


for arbitrary wave functions. By this relation, the lower bound 
of relation O was later set as 

eiq)e{p) > ( 4 ) 

where Ti = h/{2n). 

Text books^"— up to the 1960s often explained that the 
physical meaning of Heisenberg’s uncertainty principle is ex¬ 
pressed by Eq. (HI, but it is formally expressed by Eq. Q. This 
explanation is later considered to be confusing. In fact, it was 
said that Eq. ([Hi expresses a limitation of measurements, while 
mathematically derived relation Eq. (IS expresses a statistical 
property of quantum state, or a limitation of state preparations, 
so that they have different meanings^. Thus, Heisenberg with 
the subsequent completion by Kennard has long been cred¬ 
ited only with a constraint for state preparation represented by 
Eq. dS. 

This paper aims to resolve this long standing confusion. It 
will be shown that Heisenberg! in 1927 actually “proved” not 
only Eq. (|S but also Eq. ([TJ from basic postulates for quan¬ 
tum mechanics. In showing that, it is pointed out that as 
one of the basic postulates Heisenberg supposed an assump¬ 
tion called the “repeatability hypothesis”, which is now con¬ 
sidered to be obsolete. In fact, in the 1930’s the repeatabil¬ 
ity hypothesis was explicitly claimed by von Neumann! and 
Schrbdinger—, whereas this hypothesis was abandoned in the 
1980s, when quantum measurement theory was establish to be 
general enough to treat all the physically realizable measure¬ 
ments. 

Through those examinations it will be concluded that 
Heisenberg’s uncertainty principle expressed by Eq. (01 is log¬ 
ically a straightforward consequence of Eq. (O under a gener¬ 
alized form of the repeatability hypothesis. In fact, under the 
repeatability hypothesis a measurement is required to prepare 
the state with a sharp value of the measured observable, and 
hence the “measuremental” uncertainty relation (0]) is a logical 
consequence of the “preparational” uncertainty relation (|3]l. 

As stated above, the repeatability hypothesis was aban¬ 
doned in the 1980s, and nowadays relation (0|i is taken to be 
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a breakable Naturally, the problem remains: what 

is the unbreakable constraint for simultaneous measurements 
of non-commuting observables? To answer this question, we 
will survey the author’s recent proposaUiril for a universally 
valid reformulation of Heisenberg’s uncertainty principle un¬ 
der the most general assumption on quantum measurement. 

II. REPEATABILITY HYPOTHESIS 

The uncertainty principle was introduced by Heisenberg in 
a paper entitled Uber den anschaulichen Inhalt der quanten- 
theoretischen Kinematik und Mechanik^ published in 1927D 
In what follows we shall examine Heisenberg’s derivation of 
the uncertainty principle following this paper. 

Before examining the detail of Heisenberg’s derivation, we 
shall examine the basic postulates for quantum mechanics in 
Heisenberg’s time, following von Neumann’s formulation^. 
In what follows, a positive operator on a Hilbert space with 
unit trace is called a density operator. We denote by .^(R) 
the set of Borel subsets of R and by the spectral measure 
of a self-adjoint operator A, i.e., A has the spectral decompo¬ 
sition A = f^XE^{dX). 

Axiom 1 (States and observables). Every quantum system S 
is described by a Hilbert space Jif called the state space of 
S. States o/S are represented by density operators on Jif and 
observables o/S are represented by self-adjoint operators on 

jr. 

Axiom 2 (Born statistical formula). If an observable A is 
measured in a state p, the outcome obeys the probability dis¬ 
tribution of A in p defined by 

Pr{AGA||p} = Tr[£^(A)p], (5) 

where A G .^(R). 

Axiom 3 (Time evolution). Suppose that a system S is an 
isolated system with the (time-independent) Hamiltonian H 
from time t to t -\~T. The system S is in a state p{t) at time t if 
and only ifS is in the state p (f -f t) at time t -\-T satisfying 

p{t + T) = e-‘^^/’'p{t)e‘^^/’'. ( 6 ) 

Under the above axioms, we can make a probabilistic pre¬ 
diction of the result of a future measurement from the knowl¬ 
edge about the past state. However, such a prediction applies 
only to a single measurement in the future. If we make many 
measurements successively, we need another axiom to deter¬ 
mine the state after each measurement. For this purpose, the 
following axiom was broadly accepted in the 1930s. 

Axiom 4 (Measurement axiom). If an observable A is mea¬ 
sured in a system S to obtain the outcome a, then the system S 
is left in an eigenstate of A belonging to a. 

Von Neuamann^ showed that this assumption is equiv¬ 
alent to the following assumption called the repeatability 
hypothesis^ (p. 335), posed with a clear operational condition 


generalizing a feature of the Compton-Simons experiment^ 
(pp. 212-214). 

(R) Repeatability hypothesis. If an observable A is mea¬ 
sured twice in succession in a system S, then we get the same 
value each time. 

It can be seen from the following definition of measurement 
due to Schrodinger given in his famous “cat paradox” paper— 
that von Neumann’s repeatability hypothesis was broadly ac¬ 
cepted in the 1930s. 

The systematically arranged interaction of two 
systems (measured object and measuring instru¬ 
ment) is called a measurement on the first sys¬ 
tem, if a directly-sensible variable feature of the 
second (pointer position) is always reproduced 
within certain error limits when the process is im¬ 
mediately repeated (on the same object, which in 
the meantime must not be exposed to any addi¬ 
tional influences)^. 

Based on the repeatability hypothesis von Neumann^ 
proved the impossibility of simultaneous measurement of two 
non-commuting observables as follows. Suppose that two ob¬ 
servables A,Z? are simultaneously measurable in every state 
and suppose that the eigenvalues of A are non-degenerate. 
Then, the state just after the simultaneous measurement of A 
and B is a common eigenstate of A and B, so that there is an 
orthonormal basis consisting of common eigenstates of A and 
B, concluding that A and B commute. 

Since Heisenberg’s uncertainty principle concerns mea¬ 
surements with errors, it is naturally expected that it can be 
mathematically derived by extending the above argument to 
approximate measurements. 


III. APPROXIMATE REPEATABILITY HYPOTHESIS 

To extend the repeatability hypothesis to approximate mea¬ 
surements, we generalize the notion of eigenstates as follows. 
For any real number / and a positive number e, a (vector) 
state \j/ is called an e-approximate eigenstate belonging to / 
iff the relation 

WAxp-XxpW < e (7) 

holds. If e = 0, the notion of e-approximate eigenstates is 
reduced to the ordinary notion of eigenstates. A real number 
/ is called an approximate eigenvalue of an observable A iff 
for every e > 0 there exists an e-approximate eigenstate of 
A. The set of approximate eigenvalues of an observable A 
coincides with the spectrum of Aii (p. 52). 

Now, we formulate the approximate repeatability hypothe¬ 
sis as follows. 

(AR) Approximate Repeatability Hypothesis. If an ob¬ 
servable A is measured in a system S with mean error e 
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to obtain the outcome a, then the system S is left in an e- 
approximate eigenstate of A belonging to a. 

Obviously, (AR) is reduced to (R) for £ = 0. Since we have 

||Ar-Av/||> ||Av/-(A)v/|| = (t(A) 

for any real number X, where (A) = (xj/jAy/), (AR) implies 
the following statement; If an observable A in a system S is 
measured with mean error £(A), then the post-measurement 
standard deviation (7(A) of A satisfies 

(7(A) < £(A). (8) 


IV. HEISENBERG’S DERIVATION OE THE 
UNCERTAINTY PRINCIPLE 


Heisenberg’s derivation of ([TJ starts with considering a state 
y/ just after the measurement of the position observable q to 
obtain the outcome q' with mean error e{q) and consider what 
relation holds between e{q) and e{p) if the momentum ob¬ 
servable p has been measured simultaneously to obtain the 
outcome p' with mean error e{p). Then, by (AR) or Eq. dH) 
the state yt should have the position standard deviation G{q) 
satsifying 


<y{q) < e{q)- 


(9) 


Heisenberg actually supposed that the state y/ is a Gaussian 
wave function^ (p. 69) 


¥{q) 


1 

{nqly/^ 


exp 


iq-q'y 

2q\ 


-^p\q-q') 


( 10 ) 


which is later known as a minimum uncertainty wave packet, 
with its Fourier transform 


w{p) 


1 


{np\yi‘^ 


exp 


{p-p'Y 

2p\ 


jq\p-p') 


( 11 ) 


and he proved relation (l2]i for the state y/ given by Eq. (fTOl) . 

Exactly this part of Heisenberg’s argument was generalized 
by Kennard2.to prove relation Q for any vector state y/. Thus, 
Kennard^ relaxed Heisenberg’s assumption on the state y/ to 
the assumption that the state yt after the position measurement 
can be arbitrary wave function y/ satisfying Eq. (0. Then, if 
the momentum observable p has been measured simultane¬ 
ously to obtain the outcome p' with an error £(/?), by (AR) or 
Eq. (l8]l again the state yt should also satisfy the relation 


a[p)<e[p). (12) 

Therefore, Heisenberg’s uncertainty relation (IHi immediately 
follows from Kennard’s relation (0. 

As above Heisenberg in 1927 not only derived relation ([T]i 
by the y-ray thought experiment but also gave its mathemati¬ 
cal proof. However, he supposed the repeatability hypothesis 


or its approximate version as an additional but obsolete as¬ 
sumption in addition to the standard postulates for quantum 
mechanics. 

The approximate repeatability hypothesis (AR) has not 
been explicitly formulated in the literature, but in the follow¬ 
ing explanation on the derivation of the uncertainty principle 
von Neumann^ (pp. 238-239) assumed (AR): 

We are then to show that if Q^P are two canon¬ 
ically conjugate quantities, and a system is in a 
state in which the value of Q can be given with the 
accuracy £ (i.e., by a 2 measurement with an er¬ 
ror range £), then P can be known with no greater 
accuracy than TJ —h/{2e). Or: a measurement of 
Q with the accuracy £ must bring about an inde¬ 
terminacy TJ = h/{2e) in the value of P. 

In the above, it is obviously assumed that a state with the posi¬ 
tion standard deviation £ is resulted by a 2 measurement with 
an error range £. This assumption is what we have generally 
formulated in Eq. ® as an immediate logical consequence of 
(AR). 

Two inequalities 0 and (HI are often distinguished as the 
preparational uncertainty relation and the measuremental un¬ 
certainty relation, respectively. However, under the repeata¬ 
bility hypothesis such a distinction is not apparent, since a 
measurement is required to prepare the state with a sharp 
value of the measured observable. In fact, the above argument 
shows that there exists an immediate logical relationship be¬ 
tween those two inequalities. 


V. ABANDONING THE REPEATABILITY HYPOTHESIS 

The repeatability hypothesis explains only a restricted class 
of measurements and does not generally characterize the state 
changes caused by quantum measurements. In fact, there 
exist commonly used measurements of discrete observables, 
such as photon counting, that do not satisfy the repeatability 
hypothesisi^ Moreover, it has been shown that the repeata¬ 
bility hypothesis cannot be generalized to continuous observ¬ 
ables in the standard formulation of quantum mechanicsi^— . 
In 1970, Davies and Lewises proposed abandoning the re¬ 
peatability hypothesis and introduced a new mathematical 
framework to treat all the physically realizable quantum mea¬ 
surements; 

One of the crucial notions is that of repeatabil¬ 
ity which we show is implicitly assumed in most 
of the axiomatic treatments of quantum mechan¬ 
ics, but whose abandonment leads to a much 
more flexible approach to measurement theory^S 
(p. 239). 

Denote by Tc{Jif) the space of trace class operators on 
by the space of density operators on Jlf, and 

by P{Tc{Jlf)) the space of positive linear maps on zc{J^). 
Davies and Lewises introduced a mathematical notion of in¬ 
strument as follows. A Davies-Lewis (DL) instrument for 
(a system S described by) a Hilbert space is defined as 
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a P(Tc(c^))-valued Borel measure on R countably addi¬ 
tive in the strong operator topology such that J^(R) is trace¬ 
preserving (Tr[j^(R)p] = Tr[p]). 

Let A(x) be a measuring apparatus for S with the output 
variable x. The statistical properties of the apparatus A(x) are 
determined by (i) the probability distribution Pr{x G A||p} of 
the outcome x in an arbitrary state p, and (ii) the state change 
p —i> P{xga} from the state p just before the measurement to 
the state P{xga} just after the measurement given the condition 
X € A. The proposal of Davies and Lewis^ can be stated as 
follows. 

(DL) The Davies-Lewis thesis. For every measuring ap¬ 
paratus A(x) with output variable x there exists a unique DL 
instrument satisfying 


Pr{xGA||p} 

= Tr[./(A)p], 

(13) 


jC{A)p 

(14) 

P P{xeA} 

~ Ti[j^(A}p]- 


For any A € .^(R), define n(A) by 

U{A)=y{A)*l, (15) 

where JL{A)* is the dual map of ^{A) given by 
Tr[(j^(A)*A)p] = Tr[A(j^(A)p)] for all A e Then, 

the map A n(A) is a probability operator-valued measure 
(POVM)^, called the POVM of satisfying 

Pr{xeA||p} = Tr[n(A)p] (16) 

for all p e and A € m{R). 

The problem of mathematically characterizing all the physi¬ 
cally realizable quantum measurements is reduced to the prob¬ 
lem as to which instruments are physically realizablei^. To 
settle this problem, standard models of measuring processes 
were introduced ii>i^ as follows. A measuring process for 
(a system described by) a Hilbert space Jif is defined as a 
quadruple consisting of a Hilbert space a 

density operator po on a unitary operator U on Jif 0 
and a self-adjoint operator M on A measuring process 
^P{),U,M) is said to be pure if po is a pure state, and it is 
said to be separable if Jfy is separable. 

The measuring process ,po,U,M) mathematically 
models the following description of a measurement. The mea¬ 
surement is carried out by the interaction, referred to as the 
measuring interaction, between the object S and the probe P. 
The probe P is described by the Hilbert space JF and prepared 
in the state po just before the measurement. The time evolu¬ 
tion of the composite system P + S during the measuring in¬ 
teraction is described by the unitary operator U. The outcome 
of the measurement is obtained by measuring the observable 
M called the meter observable of the probe P just after the 
measuring interaction. We assume that the measuring interac¬ 
tion turns on at time f = 0 and turns off at time t = At. In the 
Heisenberg picture, we write 

Ai(0)=Ai(g)l, A2(0) = l(g)A2, Ai2(Ar) = f/Ui2(0)C/, 


for an observable Ai of S, an observable A 2 of P, and an ob¬ 
servable A 12 (0) of S + P. 

Suppose that the measurement carried out by an appara¬ 
tus A(x) is described by a measuring process {Jjf,po,U,M). 
Then, it is showi>i^ that the statistical properties of the appa¬ 
ratus A(x) is given by 


Pr{xeA||p} = Tr[£"(^)(A)(p®po)], 

_ Tr^[£^(^)(A)(p0po)] 
P P{xgA} (A) (p (g) po)] 


(17) 

(18) 


where Trj^ stands for the partial trace on the Hilbert space 
Jjf. The POVM n of the apparatus A(x) is defined by 


n(A) = Trjr[£"'^>(A)(lgpo)] (19) 


for any A G .^(R). Then, the map A —n(A) is a probability 
operator-valued measure (POVM)^ satisfying 


Pr{xG A||p}=Tr[n(A)p] (20) 


for all p G y’(Jf’) and A G ^(R). 

Now it is easy to see that the above description of the mea¬ 
surement statistics of the apparatus A(x) is consistent with the 
Davies-Lewis thesis. In fact, the relation 


,/(A)p = Tr^ [(1 g£"(^)(A)) (p gpo) 


( 21 ) 


defines a DL instrument J^. In this case, we say that 
the instrument is realized by the measuring process 
{j(r,Po,u,M). 

A DL instrument for Jif is said to be completely positive 
(CP) if J^(A) is completely positive for every A G ^{R), i.e., 
JC{A) gid„ : Tc{J^) ®M„ —>■ xc{Mf) ®M„ is a positive map 
for every finite number n, where Mn is the matrix algebra of 
order n and id„ is the identity map on M„. The following theo¬ 
rem characterizes the physically realizable DL instruments by 
completely positivit y^^i^^ . 


Theorem 1 (Realization theorem for CP instruments). A 

DL instrument can be realized by a measuring process if and 
only if it is completely positive. In particular, every CP instru¬ 
ment can be realized by a pure measuring process, and if 
is separable, every CP instrument for Jif can be realized by a 
pure and separable measuring process. 


Now, we have reached the following general measurement 
axiom, abandoning Axiom 4 or the repeatability hypothesis. 

Axiom 5 (General measurement axiom). To every mea¬ 
suring apparatus A(x) with output variable x there exists a 
unique CP instrument satisfying Eqs. m and (O- Con¬ 
versely, to every instrument J' there exists at least one mea¬ 
suring apparatus A(x) satisfying Eqs. (1Z21> and m- 


VI. VON NEUMANN’S MODEL OE POSITION 
MEASUREMENT 

Let A and B be observables of a system S described by a 
Hilbert space J(f. Let A(x) be a measuring apparatus for S 
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with the output variable x described by a measuring process 
M = ,Pq,U^M) from time t = 0 to t = At. An approxi¬ 
mate simultaneous measurement of A(0) and B{Q) is obtained 
by direct simultaneous measurement of commuting observ¬ 
ables M{At) and B{At), where M{At) is considered to ap¬ 
proximately measure A(0) and B{At) is considered to approx¬ 
imately measure B{Q). In this case the error of the B{Q) mea¬ 
surement is called the disturbance of B caused by the mea¬ 
suring process M, and the relation for the errors of the A(0) 
measurement and the B{Q) measurement is called the error- 
disturbance relation (EDR). In what follows, we examine the 
EDR for position measurement error and momentum distur¬ 
bance. 

Until 1980’s only solvable model of position measure¬ 
ment had been given by von Neumann^.. We show that this 
long-standing model satisfies Heisenberg’s error-disturbance 
relationii, a version of Heisenberg’s uncertainty relation (|4]i. 

Consider a one-dimensional mass S, called an object, with 
position X and momentum px, described by a Hilbert space 
Jif = L^(R;c), where Rjc is a copy of the real line. The object 
is coupled from time f = 0 to f = Af with the probe P, another 
one-dimensional mass with position y and momentum py, de¬ 
scribed by a Hilbert space Jjf = L^(Rv), where R,. is another 
copy of the real line. The outcome of the measurement is ob¬ 
tained by measuring the probe position y at time t = At. The 
total Hamiltonian for the object and the probe is taken to be 

^s+p=^s+^p + ^^, (22) 

where and Hp are the free Hamiltonians of S and P, respec¬ 
tively, H represents the measuring interaction. The coupling 
constant K satisfies KAt = 1 and it is so strong (A" ^ 1) that 
Hg and Hp can be neglected. 

The measuring interaction H is given by 


H =X®Py, 


(23) 


By a property of convolution, if the probe probability distribu¬ 
tion |i^(y)P approaches to the Dirac delta function 5(y), the 
output probability approaches to the Born probability distri¬ 
bution |v/(x)p. 

The corresponding instrument is given by 

y{A)p= [ ^{yl-x)p^iyl-x)Uy, (27) 

JA 

and the corresponding POVM is given by 

UiA)=Jj^{yl-x)\^dy, (28) 

Solving the Heisenberg equations of motion, we have 


x(Af) = x(0), (29) 

y(Af) = x(0)+y(0), (30) 

pxiAt) = pxi0)-py{0), (31) 

Py{At) = py{0). (32) 


VII. ROOT-MEAN-SQUARE ERROR AND DISTURBANCE 

To define the “mean error” of the above position measure¬ 
ment, let us recall classical definitions. Suppose that a quan¬ 
tity A = X is measured by directly observing another quantity 
Y =y. For each pair of values (X,Y) = (x,y), the error is de¬ 
fined as y — X. To define the “mean error” given the joint prob¬ 
ability distribution (JPD) {dx,dy) of X and Y, Gauss^ 
introduced the root-mean-square (rms) error eoiXjY) of T 
for A as 

eGiX,Y)=(^jj^^{y-xfp^’\dx,dy)^ ' , (33) 


so that the unitary operator of the time evolution of S + P from 
f = 0 to f = T < Af is given by 

t/(T) = exp 0 py^ . (24) 

Suppose that the object S and the probe P are in the vector 
states \j/ and respectively, just before the measurement. We 
assume that the wave functions tff(x) and ^ (y) are Schwartz 
rapidly decreasing functions^. Then, the time evolution of 
S + P in the time interval (0, Af) is given by the unitary opera¬ 
tor U(At) = Thus, this measuring process is repre¬ 

sented by (L^(Rv), |^)(^ 

The state of the composite system S 4- P just after the mea¬ 
surement is t/(Af)v/0^. By solving the Schrodinger equa¬ 
tion, we have 

U(At)(y/<Si^)(x,y) = tl/(x)^(y-x). (25) 

From this, the probability distribution of output variable x is 
given by 

Pr{xGA||vf}= [ dy [ \\ir{x)\^\^{y-x)\^dx. (26) 

JA Jr 


which Gauss^l called the “mean error” or the “mean error to 
be feared”, and has long been accepted as a standard definition 
for the “mean error”. 

In the von Neumann model, the observable x(0) is mea¬ 
sured by directly observing the meter observable y{At). Since 
x(0) and y(Af) commute by Eq. (l30l l. we have the JPD 
(dx,dy) ofx(O) andy(Af) as 

^clx,dy) = {E^^^\dx)Ey^'^\dy)), (34) 

where (• • •) stands for the mean value in the state ^. Then, 
by Eq. ( l33l l the rms error e(x, i//) for measuring x in state \j/ is 
defined as the rms error eG(x{Q),y{At)) of y(Af) for x(0), so 
that we have 

e(x,Vf) = (^jj^^{y-x)^p^^°^’y''^'>idx,dy)'^ 

= ((y(Af)-x(0))2)i/2 = (55(0)2) 1/2. (35) 

Since pxiO) and px{At) also commute from Eq. (13X1 1. we 
also have the JPD ((ix,dy) of the values of pxiO) 

and px(At). The rms disturbance t](px, W) of Px in state y/ is 
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similarly defined as the rms error eG{px{0),Px{^)), so that we 
have 

= ((/5,(Ar)-Av(0))2)*/" = (/5,(0)2)>/2. (36) 

Then, by Kennard’s inequality (O we have 

e{x,\i/)ri{px,\i/) = {y{0)y^^{py{0)y^^ 

> a{y{Q))a{py{Q))>^^. (37) 

Thus, the von Neumann model satisfies Heisenberg’s error- 
disturbance relation (EDR) 

£{x)riipx)>^^ (38) 

for e(x) = e{x,\i/) and ri(px) = rj{px,\l/). 

By the limited availability for measurement models up to 
the 1980’s, the above result appears to have enforced a pre¬ 
vailing belief in Heisenberg’s EDR ( l38b . for instance, in 
claiming the standard quantum limit for gravitational wave 
detection2^“—. 


VIII. MEASUREMENT VIOLATING HEISENBERG’S EDR 

In 1980, Braginsky, Vorontsov, and Thorna^ claimed that 
Heisenberg’s EDR ( l38T l leads to a sensitivity limit, called the 
standard quantum limit (SQL), for gravitational wave detec¬ 
tors exploiting free-mass position monitoring. Subsequently, 
Yuen^i questioned the validity of the SQL, and then Caves^ 
defended the SQL by giving a new formulation and a new 
proof without directly appealing to Heisenberg’s ERD (l3^ . 
Eventually, the conflict was reconcile d^^’^*^ by pointing out 
that Caves^Z still supposed (AR), in spite of avoiding Heisen¬ 
berg’s ERD (l38l) . More decisively, a solvable model of a pre¬ 
cise position measurement was also constructed that breaks 
the SQL^^iZS; later this model was shown to break Heisen¬ 
berg’s EDR (l38l>Zi. 

In what follows, we examine this model, which modifies the 
measuring interaction of the von Neumann model. In this new 
model, the object, the probe, and the probe observables, the 


coupling constant K, and the time duration Af are the same as 
the von Neumann model. The measuring interaction is taken 
tobe^Z 

71 

H = — —{2x<^py — 2px®y-\-xpjc® 1 - 1 

3v 3 

. (39) 

The corresponding instrument is give byiZ 


J{A)p= [ e-“'^-|0)(0|e-“P-Tr[£^(£/x)p], 
Ja 

(40) 

where 0(x) = ^(—x), and the corresponding POVM 
by 

is given 

n(A) =£"'(A). 

(41) 


Solving the Heisenberg equations of motion, we have 


x(Af) = x(0)-y(0), (42) 

y{At) = x(0), (43) 

Px{At) = -py{0), (44) 

PyiAt) = Px{0)+Py{0)- (45) 

Thus, x(0) andy(Ar) commute and also px(0) and px{^t) 
commute, so that the rms error and the rms disturbance are 
well defined by their JPDs, and given by 

e(x,v/) = 0, (46) 

riiPx^V) = {iPyiO)+PxiO))Y^^ (47) 

Consequently, we have 

e{x)r]{px) = 0. (48) 


Therefore, this model obviously violates Heisenberg’s EDR 

(El. 


IX. UNIVERSALLY VALID ERROR-DISTURBANCE 
RELATION 

To derive a universally valid EDR, consider a measuring 
process M = (J^,po,t/,M). If A(0) andM(Af) commute, the 
rms error of the measuring process M for measuring A in p 
can be defined through the JPD of A(0) and M{Al). Similarly, 
if B{Q) and B{At) commute, the rms disturbance can also be 
defined through the JPD of B{Q) and B{At). In order to extend 
the definitions of the rms error and disturbance to the general 
case, we introduce the noise operator and the disturbance op¬ 
erator. 

The noise operator N{A) is defined as the difference 
M{At) — A(0) between the observable A(0) to be measured 
and the meter observable M(Af) to be read and the disturbance 
operator D{A) is defined as the the change B{At) —5(0) of B 


caused by the measuring interaction, i.e., 

N{A) = M(Af)-A(0), (49) 

D{B) = B{At)-B{0). (50) 

The mean noise operator n{A) and the mean disturbance op¬ 
erator d{B) are defined by 

«(A) = Tr^[V(A)l0po], (51) 

diB) = Tr^[D(5)l®po]. (52) 

The rms error e(A,p) and the rms disturbance rj{B,p) for 
observables A,5, respectively, in state p are defined by 

e(A,p) = (Tr[A(A)V®Po])'/^ (53) 

77(5,p) = (Tr[D(5)V(g)po])'/^ (54) 


An immediate meaning of e(A,p) and rj{B,p) are the rms’s 
of the noise operator and the disturbance operator. 
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Suppose thatM(Af) and A(0) commute in p 0 po, i.e., 

[£^W(A),£"(^)(r)]p0po = O (55) 

for all A,r G In this case, the relation 

p^(0)’"(A')(c/x,<iy) = TT[E'^^^\dx)E^^^'>{dy)p(g)po] (56) 
defines the JPD of A(0) andM(At) satisfying 

Tr[p(A(O),M(Ar))p0po] = JJ^^p{x,y) 

(57) 

for any real polynomial p(A(0),M(Ar)) inA(O) andM(Af)^. 
Thus, the classical rms error ec{A{0),M{Al)) of M{At) for 
A(0) is well defined, and we easily obtain the relation 

e(A,p) = eG(A(0),M(Af)). (58) 

Similarly, we have ri{B,p) = ec{B{0),B{At)) if B{0) and 
B{At) commute in p ® po- 

In 2003, the present autho r* derived the relation 

e(A)77(B) + |([n(A),B]) + ([A,uf(B)])| > 1 |([A,B])|, (59) 

where e(A) = e(A,p), 77 (B) = 77 ( 5 ,p), which is universally 
valid for any observables A,B, any system state p, and any 
measuring process M. From Eq. ( l59b , it is concluded that if 
the error and the disturbance are statistically independent from 
system state, then the Heisenberg type EDR 

e(A)77(B)>^|([A,B])| (60) 

holds, extending the previous results^—. The additional cor¬ 
relation term in Eq. ( l59b allows the error-disturbance product 
e{A)ri{B) to violate the Heisenberg type EDR (I 6 OI 1 . In gen¬ 
eral, the relation 

e(A)J 7 (B) + e(A)( 7 (B) + c 7 (A) 77 (B) > i|([A,B])|, (61) 

holds for any observables A,B, any system state p, and any 
measuring process ]Vtii ~'3,35i4Qi^* . 

The new relation (IhTT i leads to the following new constraints 
for precise measurements and non-disturbing measurement; 
then 

ct(A) 77(B) > ^|([A,B])|, ife(A)=0, (62) 

£(A)(j(B) > ^|([A,B])|, if 77 (B) =0. (63) 

Note that if ([A,B]) 7 ^ 0, the Heisenberg type EDR (I 6 OI 1 leads 
to the divergence of e(A) or 77 (B) in those cases. The new 
error bound Eq. (l 6 ^ was used to derive a conservation- 
law-induced limits for measurementsi^f^"^ quantitatively 
generalizing the Wigner-Araki-Yanase theorem^"^ and was 
used to derive a fundamental accuracy limit for quantum 
computingi^. 


X. QUANTUM ROOT MEAN SQUARE ERRORS 

We say that the measuring process M is probability repro¬ 
ducible for the observable A in the state p iff 

Tr[£M{Af) ^ 'pr[£'^ (A)p] (64) 

holds for all A G 3§{K). The rms error e{A,p) satisfies that 
p(A,p) = 0 for all p if and only if M is probability repro¬ 
ducible for A in all p *^'^* . Thus, the condition that e(A, p) = 0 
for all p characterizes the class of measurements with POVM 
n satisfying H = 

Busch, Heinonen, and Lahti^ pointed out that there are 
cases where e(A,p) =0 holds but M is not probability re¬ 
producible and where M is not probability reproducible but 
e(A, p) = 0 holds, and questioned the reliability of the rms er¬ 
ror e (A.p) as a state-dependent error measure. However, their 
argument lacks a reasonable definition of precise measure¬ 
ments, necessary for discussing the reliability of error mea¬ 
sures. In response to their criticism, the present autho r^^i^'* 
has successfully characterized the precise measurements of A 
in a given state p and shown that the rms error e(A,p) re¬ 
liably characterizes such measurements. In what follows we 
survey those results, which were mostly neglected in the re¬ 
cent debates^—. 

Let us start with the classical case. Suppose that a quantity 
A = X is measured by direct observation of another quantity 

Y = y. Then, this measurement is precise iff A =Y holds with 
probability 1, or equivalently the JPD {dx,dy) of A and 

Y concentrates on the diagonal set, i.e., 

IxT^y}) = 0. (65) 

As easily seen from Eq. (l33T l. this condition is equivalent to 
the condition edXjY) = 0 . 

Generalizing the classical case, we say that a measuring 
process M makes a strongly precise measurement of A in p iff 
A(0) =M(Af) holds with probability 1 in the sense that A(0) 
and M{At) commute in p 0 po and that the JPD 
concentrates on the diagonal set, i.e., 

/{0)’"(A') ({(x,y) G R" I X 74 3 ,}) = 0. ( 66 ) 

On the other hand, we have introduced another operational 
requirement. The weak joint distribution of A(0) 

andM(Af) in a state p is defined by 

^A(0),M(A/) ^ Tr[£'^(‘’) {dx)E^^'^'l {dy)p (g) po]. (67) 

The weak joint distribution is not necessarily positive but 
operationally accessible by weak measurement and post- 
selection^. We say that the measuring process M makes a 
weakly precise measurement of A in p iff the weak joint dis¬ 
tribution p^(**)’***(^) in state p concentrates on the diagonal 
set, i.e., 

^A{0).M(A/) g r2 I ^ ^ ^ Q 

This condition does not require that A(0) and M{At) com¬ 
mute, while it only requires that the weak joint distribution 
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concentrates on the event A(0) = M{At). A similar condi¬ 
tion has been used to observe momentum transfer in a double¬ 
slit ‘which-way’ experiment^^i^. We naturally consider that 
strongly preciseness is a sufficient condition for precise mea¬ 
surements and weak preciseness is a necessary condition. In 
the previous investigation a^^i^"* , it was mathematically proved 
that both conditions are equivalent. Thus, either condition is 
concluded to be a necessary and sufficient condition charac¬ 
terizing the unique class of precise measurements. As above, 
we say that the measuring process M precisely measures A in 
p iff it makes a strongly or weakly precise measurement of A 
in p. 

To characterize the class of precise measurements in terms 
of the nns error-freeness condition, £ (A, p) =0, and the prob¬ 
ability reproducibility condition, we introduce the follow¬ 
ing notions. The cyclic subspace '^{A,p) generated by A 
and p is defined as the closed subspace of Jif generated by 
{E^{A)<1) I A G G ran(p)}, where ran(p) denotes the 

range of p. Then, the following theorem hold a^^'^"* . 

Theorem 2. Let M = ,Pq,U ,M) be a measuring process 
for the system S described by a Hilbert space Jif. Let A be 
an observable of S and p a state o/S. Then, the following 
conditions are equivalent. 

(i) M precisely measures A in p. 

(ii) £(A,0) = 0 in all 0 G ^{A,p). 

(iii) M is probability reproducible for A in all 0 G ^{A,p). 

In the case where A(0) and M{At) commute, precise mea¬ 
surements are characterized by the rms error-freeness condi¬ 
tion, since in this case we have £G(A(0),M(Af)) = e{A,p). 
However, the probability reproducible condition does not 
characterize the precise measurements even in this case. To 
see this suppose that A(0) and M{At) are identically dis¬ 
tributed and independent^ (p. 763). Then, we have 

£G(A(0),M(At)) = jj^p-xfp^^^\dx)p^^^'\dy) 

= (t(A(0))2 + a{M{At)f + ((A(0)) - {M{At)))\ 

Since C7(A(0)) = G{M{At)) and (A(0)) = (M(Af)), we have 

£G(A(0),M(Af)) = V2a(A). (69) 

Thus, M is not a precise measurement for the input state p 
with <7(A) f 0. In the case where A(0) and M{Al) do not 
commute, the rms error-freeness condition well characterizes 
precise measurements to a similar extent to the probability re¬ 
producibility condition. In particular, the class of measuring 
processes precisely measuring A in all p is characterized by 
the following equivalent conditions^^*^: (i) e(A,y/) =0 for 
all y/ G (ii) probability reproducible for A in all yr G Jif; 
(iii) Yl — E^. The above result ensures our long-standing be¬ 
lief that a measurement with POVM TI satisfying TI = is 
considered to be precise in any state in the sense that the mea¬ 
sured observable A(0) and the meter observable M{At) to be 


directly observed are perfectly correlated in any input state, 
not only reproducing the probability distribution in any state. 

We say that the measuring process M does not disturb an 
observable B in a state p iff observables B{Q) and B{At) com¬ 
mute in the state p 0 po and the JPD of B{0) and 

B{At) concentrates on the diagonal set. The non-disturbing 
measuring processes defined above can be characterized anal¬ 
ogously. 

From the above results, a non-zero lower bound for £(A) or 
77 (B) indicates impossibility of precise or non-disturbing mea¬ 
surement. In particular, if ( 7 (A), ( 7 (B) < 00 and ([A,B]) fO, 
then any measuring process cannot precisely measure A with¬ 
out disturbing B. 

The above characterizations of precise and non-disturbing 
measurements suggests the following definitions of the locally 
uniform nns error e{A,p) and the locally uniform rms distur¬ 
bance rj{B,p)^: 

e(A,p) = sup £(A,0), (70) 

rfiB,p) = sup 77 (B, 0 ). (71) 

Then, we have e{A,p) — 0 if and only if the measurement 
precisely measures A in p, and that rf{B,p) = 0 if and only if 
the measurement does not disturb B in p. For those quantities, 
the Heisenberg type EDR 

^ (72) 

is still violated by a linear position measurement^, and the 
relation 

e{A)lJ{B) + e{A)a{B) + a{A)rf{B) > ^K[A,B])| (73) 

holds universally^, where £(A) = e{A,p) and T[{B) = 
B{B,p). 

Thus, the locally uniform rms error e{A,p) completely 
characterizes precise measurements of A in p and the lo¬ 
cally uniform rms disturbance rf{B,p) completely character¬ 
izes measurements non-disturbing B in p, while they satisfy 
the EDR of the same form as the rms error and disturbance. 
Eurther investigations on quantum generalizations of the clas¬ 
sical notion of root-mean-square error and EDRs formulated 
with those quantities will be reported elsewhere. 
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